Skip to content

Conversation

@debayang
Copy link

@debayang debayang commented May 4, 2018

This ports the random hash optimization done for lj_new_str() for x64 ( in 7923c63 ) to arm64.

Verified on Arm64 platform with the existing benchmark developed as part of the x64 commit above - shows similar improvements as seen for x64 with this benchmark.

However not sure about the effect on any real time workloads.

lj_str_indep_hash(GCstr *str) {
return lj_str_original_hash(strdata(str), str->len);
}

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

style: needs 2 blank lines here like before.

CCOPT_x64=
CCOPT_arm=
CCOPT_arm64=
CCOPT_arm64= -march=armv8-a+crc
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder if this breaks compatibility with other arm64 variants?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The "+crc" option is needed to inform the compiler that crc instruction valid for this arch.
The proper fix might be to check if crc support is available in hardware at runtime .
Modified the code to take care of above.

v = (v << 8) | len;
return __crc32cw(h, v);
}

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

IIRC, on X86-64, if the string being hashed having length under 4, original hash function seem to be working better.

Other than that, LGTM.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As per my tests on ARM64 platform the micro benchmark did not show any noticeable differences for below 4 bytes.

@agentzh
Copy link
Member

agentzh commented Aug 2, 2019

@siddhesh Will you mind having a quick look at this one? ;) Many thanks!

Copy link
Collaborator

@siddhesh siddhesh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There isn't a lot of difference in algorithms between the x86_64 and arm64 versions, so it is probably a much better idea to move lj_str_hash_x64.h as lj_str_hash.h and then just add #defines to define the various crc32 macros and the additional platform feature check for arm64.

That will make sure that future maintenance of these functions is easy.

LJ_STR_HASH = lj_str_hash;
}
else {
LJ_STR_HASH = lj_str_original_hash;
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggest: Initialize LJ_STR_HASH to lj_str_hash and only if HWCAP_CRC32 is not set, reset it to lj_str_original_hash over here.

@siddhesh siddhesh mentioned this pull request Aug 6, 2019
@agentzh
Copy link
Member

agentzh commented Sep 2, 2019

@siddhesh Thanks for looking into this! The original author of this PR is no longer responding it seems. Will you please create a separate PR so that we can merge the final version quickly? Many thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants